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2a)S This action is FINAL. 2b)D This action is non-final. 
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Disposition of Claims 
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6) I3 Claim(s) 1-25 is/are rejected. 
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Applicant may not request that any objection to the drawing(s) be held in abeyance. See 37 CFR 1 .85(a). 
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DETAILED ACTION 
Response to Amendment 

1 . In response to the office action from 10/28/2005, the applicant has submitted a request 
for continued examination, filed 12/19/2005, arguing to traverse the art rejection based on the 
limitation regarding (Amendment, Pages 8-10). Applicant's arguments have been fully 
considered, however the previous rejection is maintained due to the reasons listed below in the 
response to arguments. 

Response to Arguments 

2. Applicant's arguments have been fully considered but they are not persuasive for the 
following reasons: 

With respect to Claims 1, 13, and 25, the applicant argues that the prior art of record 
fails to teach the step for ranking linguistic features (Amendment, Pages 9-10). Specifically, the 
applicant notes that it is incorrect to insert the linguistic features of the Eide et al reference ("A 
Linguistic Feature Representation of the Speech Waveform, " 1993) into the calculations recited 
in the Nouza et al reference ( n Feature Selection Methods for Hidden Markov Model-based 
Speech Recognition") (Amendment, Pages 9-10). In response, the examiner notes that Nouza 
teaches the concept for ranking speech objects within a Hidden Markov Model (HMM) based on 
a measure of object similarity (Pages 186-187), while Eide teaches objects within a Hidden 
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Markov Model (Markov source) as linguistic feature observations. When taken in combination, 
using the linguistic features taught by Eide for the distinguishing objects within the HMM for 
taught by Nouza to calculate object similarity in feature ranking, provides the benefits of 
improved speech recognition and keyword spotting (Eide, Pages 485-486). Also such a 
combination is valid because the parameters taught by both references are used to distinguish 
speech as a part of HMM models (Nouza, parameters distinguishing models of speech objects, 
Page 187; and Eide, distinctive speech features, Page 483) and Nouza provides no negative 
teachings regarding the incorporation of linguistic features. Thus, Claims 1, 13, and 25 remain 
rejected. 

In response to applicant's argument that the references fail to show certain features of 
applicant's invention, it is noted that the features upon which applicant relies (i.e., linguistic 
feature calculations processed based on the absence or presence of the features, Amendment, 
Page 9) are not recited in the rejected claim(s). Although the claims are interpreted in light of 
the specification, limitations from the specification are not read into the claims. See In re Van 
Geuns, 988 F.2d 1181, 26 USPQ2d 1057 (Fed. Cir. 1993). 

The dependent claims further limit their rejected independent claims, and thus, also 
remain rejected. 

Claim Rejections - 35 USC §103 



3. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 
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(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are 
such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the 
manner in which the invention was made. 

4. Claims 1- 24 are rejected under 35 US.C. 103(a) as being unpatentable over Nouza 
( "Feature Selection Methods for Hidden Markov Model-based Speech Recognition ") in view of 
Eide et al ("A Linguistic Feature Representation of the Speech Waveform, " 1993), and further in 
view of De Souza et al (U.S. Patent: 5,884,261). 

With respect to Claims 1 and 13, Nouza recites: 

Obtaining speech input data (HMM and DTW speech recognition systems, Page 188, Col 
1, Lines 5-7; Inherently, speech data would have to be received in order for speech to be 
recognized by the recognition system.) ; 

Building a model for each feature of an original set of features (parameters used to 
distinguish models of different speech objects in the form of Gaussian mixture pdfs, Page 187, 
Col 1, Lines 5-9, and evaluated for individual feature contributions for speech unit 
classification, Page 188, Col. 1, Lines 11-13); 

Ranking the features (feature significance factor that can be used for ordering features, 
Page 188, Col. 1, last paragraph - Col 2, first paragraph) ; 

Nouza does not teach the use of linguistic features in building speech models for 
recognition, however, Eide discloses a method for creating speech recognition models using 
speech features that have been linguistically classified (Pages 483-484, Section 1, and Table 1), 

Nouza and Eide are analogous art because they are from a similar field of endeavor in 
speech recognition feature processing. Thus, it would have been obvious to a person of ordinary 
skill in the art, at the time of invention, to combine the use of linguistic features in the creation of 
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a speech model for recognition as taught by Eide with the speech recognition system utilizing 
feature selection as taught by Nouza to improve recognition accuracy through contextual 
information provided by linguistic features, thus implementing a means of keyword spotting 
(Eide, Page 485-486, Section 3 and Table 5). 

Neither Nouza nor Eide explicitly teach the additional step of rebuilding the model for 
each of a preselected number of ranked features, however De Souza discloses a means for 
updating speech recognition model arcs utilizing speech features having a highest likelihood of 
an acoustic match (Col. 16, Lines 15-59). 

Nouza, Eide, and De Souza are analogous art because they are from a similar field of 
endeavor in speech recognition feature processing. Thus, it would have been obvious to a person 
of ordinary skill in the art, at the time of invention, to modify the teachings of Nouza in view of 
Eide with the ability to rebuild speech recognition model arcs utilizing speech features having a 
highest likelihood of an acoustic match as taught by De Souza in order to implement more 
accurate speech recognition by creating updated models which best represent speech feature data 
(De Souza, Col 16, Lines 38-42). 

With respect to Claims 2 and 14, Nouza further discloses: 

The method and apparatus according to claims 1 and 13, respectively, wherein said step 
of building a model for each of a pre-selected number N of the ranked features comprises 
building a model for the top N ranked features (reducing the size of feature vectors used in 
speech processing, Abstract, Lines 5-6, feature significance factor that can be used for ordering 
features, Page 188, Col. 1, last paragraph - Col. 2, first paragraph, and identifying correct and 
incorrect speech models based upon those features, Page 188, Lines 26-32. It would be 
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inherent, upon selection of principal components with the largest amount of variance from an 
ordered component set used for separating hypothesis choices, that the principal component 
features being of a highest likelihood to represent a particular state within a HMM be used to 
remodel a most likely HMM candidate for speech recognition.) 

With respect to Claims 3 and 15, Nouza additionally recites: 

The method and apparatus according to claims 1 and 13, respectively, further comprising 
the step of compiling a confusion matrix for each feature of the original set of features 
subsequent to said step of building a model for each feature of an original set of features 
(covariance matrix used to evaluate the contributions of a feature in speech classification and to 
determine whether a particular speech model is correct or incorrect, Page 188, Col 1, Lines 11- 
32). 

With respect to Claims 4, 5, 16, and 17, Nouza further discloses: 
The method and apparatus according to claims 3 and 15 and claims 4 and 16, 
respectively, wherein said step of compiling a confusion matrix comprises computing a score for 
each feature based on the likelihood, as a log-likelihood as per Claim 5, of its presence in a frame 
of the speech input data (contribution of a feature within a covariance matrix in identifying a 
speech unit using a particular speech model, which is represented by a log-likelihood score, 
Page 188, Col 1, Lines 13-23). 

With respect to Claim 6 and 18, Nouza teaches the method and system of feature 
selection in recognizing a speech unit, utilizing a confusion matrix used to evaluate the 
contributions of a feature in speech classification and to determine whether a particular speech 
model is correct or incorrect as applied to Claims 3 and 15, while De Souza teaches the means of 
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rebuilding speech recognition model arcs utilizing speech features having a highest likelihood of 
an acoustic match, as applied to Claims 1 and 13. Neither Nouza nor De Souza specifically 
teaches comparing likelihood scores to a predetermined threshold as a means of detecting 
whether a speech feature is useful in picking a correct classification however, Eide discloses: 

Compiling a confusion matrix further comprises comparing each score of each feature 
with a threshold (detection of a particular linguistic feature within a phoneme that would 
inherently require some type of threshold comparison to determine the presence of such a 
feature, Pages 484-484, Section 1 and Tables 1-4). 

Nouza, Eide, and De Souza are analogous art because they are from a similar field of 
endeavor in speech recognition feature processing. Thus, it would have been obvious to a person 
of ordinary skill in the art, at the time of invention, to modify the teachings of Nouza and De 
Souza with the use of a threshold comparison in determining the presence of a particular 
linguistic feature within a phoneme as suggested by Eide to provide a well-known and 
convenient means of detecting if a linguistic feature is present in picking a correct phoneme 
classification through threshold comparison. Therefore, it would have been obvious to combine 
Eide, Nouza, and De Souza for the benefit of detecting the presence of a particular linguistic 
feature for phoneme classification. 

With respect to Claims 7 and 19, Eide additionally discloses: 

Calculating mutual information between truth and labels for each feature (determination 
of the absence or presence of a particular speech feature designated by a "+ " or " in 
phoneme classification, Pages 483-484, Section 1, and Tables 1-4), 

With respect to Claims 8 and 20, Eide further recites: 
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Ranking the mutual information calculated in compiling the confusion matrix 
(determination of the most-likely linguistic classes used to describe a phoneme, which would 
inherently require a step of linguistic feature ranking, Pages 483-484, and Tables 1-4). 
With respect to Claims 9, 11, 21, and 23, Eide additionally recites: 
Partitioning the speech input data in parallel, once fore each linguistic feature (dividing 
speech training data according to linguistic feature truth labels, Page 483, Section 1)\ and 
Producing an observation vector (calculating attribute vectors, Page 483 ', Section 1). 
With respect to Claims 10, 12, 22, and 24, Eide further discloses: 
Portioning data in parallel from the observation vector, once for each feature (dividing 
attribute vectors into feature-present and feature-absent sets for all linguistic features, Page 484, 
Section 1); and 

Producing final observations (final determination of whether a particular linguistic 
feature is present or absent in speech training data, Page 484, Section 1). 

5. Claim 25 is rejected under 35 U.S.C. 103(a) as being unpatentable over Nouza ( ''Feature 
Selection Methods for Hidden Markov Model-based Speech Recognition ") in view of Eide et al 
("A Linguistic Feature Representation of the Speech Waveform, " 1993), further in view of De 
Souza et al (U.S. Patent: 5,884,261), and yet further in view of the applicant's admitted prior 
art. 

With respect to Claim 25, Nouza in view of Eide, and in further view of De Souza 
teaches the method of linguistic feature selection in building a speech recognition unit as applied 
to Claim 1. While Nouza in view of Eide, and in further view of De Souza, does not teach it, it is 
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the applicant's admitted prior art to implement the method taught by Nouza in view of Eide 
using a computer program contained on a computer storage device, since computers are 
conveniently used and their programs easily updated for performing speech recognition 
operations, while a storage device would offer a means of storing any training databases or other 
necessary stored information. Therefore, it would have been obvious to one of ordinary skill in 
the art, at the time of invention, to implement a linguistic feature selection method for 
recognition model building using a computer program transferable between various machines 
through the use of a storage device, thus increasing method adaptability, to obtain the invention 
as specified in Claim 25. 

Conclusion 

6. THIS ACTION IS MADE FINAL. Applicant is reminded of the extension of time 
policy as set forth in 37 CFR 1.136(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within TWO 
MONTHS of the mailing date of this final action and the advisory action is not mailed until after 
the end of the THREE-MONTH shortened statutory period, then the shortened statutory period 
will expire on the date the advisory action is mailed, and any extension fee pursuant to 37 
CFR 1 .136(a) will be calculated from the mailing date of the advisory action. In no event, 
however, will the statutory period for reply expire later than SIX MONTHS from the mailing 
date of this final action. 
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7. The prior art made of record and not relied upon is considered pertinent to applicant's 
disclosure: 

Kirchhoff ("Robust Speech Recognition Using Articulator)? Information, " 1999)- teaches 
a method for selecting N-best linguistic features. 

8. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to James S. Wozniak whose telephone number is (571) 272-7632. 
The examiner can normally be reached on M-Th, 7:30-5:00, F, 7:30-4, Off Alternate Fridays. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, David Hudspeth can be reached at (571) 272-7843. The fax phone number for the 
organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or Public PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). 



James S. Wozniak 
2/3/2006 



DAVID HUDSPETH 
SUPERVISORY PATENT EXAMINER 
TECHNOLOGY CENTER 2600 



